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REAL SPECTRAL DATA • pRsS.10 

<?xinIv8rsion-"1.0"7> 

< I-compoundOB version - "2.0"- > 

< compound > \ 200 

< name > LacticAcid < /name > 

< moiecularWeight > 90.08 < /moiecidarWeight > 

< experiment > 
<pH>5.1</pH> 

<tBmpBratureKBlwn> 298.1 50000 </temperatiireKelwin> 

< DSSRatio > 0.533668 < /DSSRatio > 

< concentration > 4.30871 0 < /concentration > 
<tim8stamp>Sept52001</timestamp> \ 

< sourceRD > lacticacid.fid < /sourceFID > 

< magnetlWHz > 400.1 20773 < /magnetMHz > 

< spectralWidthHz > 6006.006006 < /spectralWidthHz > 
</experim8nt> 

< cluster > 206 
2iO-^<protonNumber>3</protonNuniber> ^ 
^y^^<m^'(\^Vi^ri> 1 < /quantification > 
<subCiuster> 
<centerPPII«> 1.321559 </centerPPM> 

214-^ < lorentzianWidtliAdjust > 2.893258 </IofBntzianlWdthAdiiist> 
<paak> 1 

220^^ < of f setCenterHz > -3.455480 < /off setCenterHz > 
222-^ < height> 0.988680 < /height > [ 216 

224-^ <Pfo^o"'^atio> 1-491461 </protonRation> > 204 

</p8ak> J 

<pe8fc> 

< of f setCenterHz > 3.455480 < /offsetCenterHz > 
<height> 1.0000000 </h8ight> \ 218 

< protonRation > 1 .508539 < /protonRatio > 
</peal(> 

</subCluster> 
</cluster> FIG. 7A 
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208 

<ciuster> 

< protonNumbar > 1 < /protonlAimlwr > 

< quantifteatim > 0 < fquantrficatfon > 
<subClust8r> 

< cemerPPM > 4.11 8849 < /centerPPM > 
<lor8nl2i8nWidthAdjust> 1 3.441432 <lorent2ianW{dthAdiust> 
<pe8k> 

< offsetCenterHz>-10.366999 < /offsetCBnterHz> 

< hdeht > 0.057244 < /hstght > 
<protonRatH» 0.130861 </protonRatio> 

</pBak> 
<peak> 

< offsetCenteriti >-3.43221 144 </offsetCenterHz> 

< hBight> 0.164751 </height > 

< protonRatio > 0.376629 < /protonRatio > 
</p8ak> 

<peak> 

<offsBtCfiirtefH2>3.487744</offs8tCanterHz> 

< hatglit > 0.161840 < /height > 

< protoifiatio > 0.3^75 < /protoiriflatio > 
</peal(> 

<peak> 

< off setCentsrHz > 1 0 J66999 < /offsetCenterHz > 
<height>0.053601</heigbt> 

< protonRatio > 0.122535 < /protnflatio > 
</peak> 

</subChister> 
</ciustejr> 

< /confound > -> 



FIG.TB 
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Real Spectral Data • pHs5.45 

<?»nl¥erslon-"1.0"?> 

< I- con^oinulDB ver^-'2.0" -> 

< compound > 

< nain8> L8cticAdd</nane> 

< moiecularWoight > 90.08 </inoIecuiarWsight > 



c experiment > 
<pH>5.45</pH> 

< tem{er8tiifeKdvin > 288.1 50000 < /temperatureKelvh > 

< DSSRatio> 0.S33868 < /DSSRatio > 

< concentr8tion> 4.308710 </concentration> 

< tumstanv > Sept 5 2001 < /timestnnp > 
<sourcenD > iaetie8Gld.fii«/soiirceFIO> 

< magnetMHz > 400.1 20689 < /magnetMHz > 

< spectraiWidthHz > 6008.006008 < /spectralWidthHz > 
</exp8raiient> 



< cluster > 

< protonNumber > 3 < ^rotonNumbw > 

< quantffleatfon > 1< /quantisation > 
<subChBtor> 

< centerPPM > 1.318770< /centerPPM > 

< lorsntzianWidthAdpist > 1.562004 < /torentdanWidthAdjust > 
<peak> 

< offsetCenterHz > -3.453329 < /offsetCenterHz > 

< height > 0.983656 < /height > 

< protonRatio > 1 .487641 < /protonRatio > 
</p8ak> 

<peak> 

< offsetCenterHz > 3.453329 < /offsetCenterHz > 

< height > 1.000000 < /height > 

< protonRatio > 1.51 2359 < /protonRatio > 
</peak> 

</eubCbster> 
</chistBr> 

FIG. 13A 
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< cluster > 

< protonNumber > 1 < /protonNumto > 

< quantificatiin) > 0 < /quantrficaflon > 
<siibCluster> 

< centerPPM > 4.11 0980 < /centsrPPM > 

< lorsntzlanWidthAdjust > 7^502 < /lorentzianWidthAdjust > 
<peak> 

<0ff88tCent8rHi> -10.371082<foffs8tC8nterHz> 

< hnght > 0.054671 < /height > 
<protonRatio > 0.124148 </protonR8tio> 

</p8ak> 
<peak> 

< off set CenterHz > -3.448809 < /offsstCenterHz > 

< height > 0.1 64509 < /height > 

< protonRatio > 0.373572 < /protonRatio > 
</peak> 

<pe8k> 

<offsetCenterHz> 3.456818 </offs8tCenterHz> 

< height > 0. 1 63879 < /height > 

< protonRatio > 0.372141 < /protonRatio > 
</peak> 

<|ieBk> 

<offsetCenterHz> 10.371092 </offsatC8ntarHz> 

< height > 0.057309 < /height > 

< protonRatio > 0.1 30139 < /protonRatki > 
</peak> 

</subCiuster> 
</chi$t8r> 
</coii^nd> 



FIG. 13B 
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Interpolsteil Data • pH«5.28 • 



<7xrolv8rslon-"1.0"?> 

< I- GompoundOB version- "10" -> 
<coinpounfi> 

< name > LacticAcid </nani8 > 

< imriacuiarWdglit > 81108 < MniecularWeigiit > 

<experiinent> 
<pH>5.45<Jj>H> 

<t8mparatureKeMn>298J50WW</t«np8ratiir8Kelvjii> : copied 

< DSSRatio > 0.533688 </DSSRatlo > 

< conc8ntration>4.30B710<A:oiic8iitration> 
<tiin8Stamp> Sept S 2001<Aini8Stainp> 
<sfliircsHO>IaetfC8cid.fid</soiifteHO> .xopiBd 
<.inagnetMH2> 400.120899</nia8n8tMHz> 

< spectraiVlfidthHz> 6006.006006<ispedraiViridt^ > 
< /experiment > 



< cluster > 

< protonNiind)er> 3 < /protonlihintw> 

< qu8i»tlfiC8tJon> 1</quentrficat{on> 
<siAChister> 

<C8iitefPPM > 1.3202 <teeiitflrPPM> : intHfp iihteil 
<lofBn^anWldtbA(|ju$t> U62004<flor8nt2ianWldtWU8ust> 

<offsetCtoterH2>-3.453329</offeetCenlerH2> 

< heiglit > 0.983656 < /heigbt > 

<protonRatio> UB7641 </protonltetlo> 
</peak> 
<peak> 

<crfsBtC«ntefHz>3.453329</off$8tCenterH2> 
<helofrt> J.000000<ihelBlrt> 

<protonR8tio>U12359</protonRatio> 
</p88k> 

<i^ClifSter> 

</cfcist.r> FIG. 14A 
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< cluster > 
<protonNiHnber> 1 <Airotoid4umber> 
< quantification > 0 < /qiuntificatioR > 
<stibCiuster> 

< canterPPM > 4.1 149 < teenterPPM > :intBrpolated 

< lorentzianWidthAiOiist > 7.044502 < /lorentdan Widths 

<p8ak> 

< offset CentarHz > • 1 0 J7 1 082 < /offsatCenterHz > 

< heigfit > 0.05467K flidght > 

< protonRatio > 0.1 24148 < /iRotonRatio > 
</i)88k> 

<peaii> 

< offsetCentsrtlz > -3.448809 < /OffsatCenterHz > 
<lieigiit> 0.164509 < /height> 

< pratonRatio > 0.373572 < /protoiflalio > 
<i|ieak> 

<pe8k> 

< offsBtCenterflz > 3.4568 1 9 < /offsatCenterHz > 
<lt8igiit> 0.163879</belglit> 
<|irotonRatio>a372141</protoiiR8tio> 

</peak> 
<|mik> 

<offsetCent8rHz> 10.371092</offsetCent8rHz> 
<l)eigi>t>0J)57309</beigiit> 

< piotonRatio > 0.1 301 39 < /protonRatto > 
</||ie8k> 

</s^Ctusttf> 
</cluster> 
</confOund> 

FIG. 14B 
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Generic Lactie Add record 



< ?Mnl version - "1 .0"? > 

< I- conqioimdOB v«sion-'2iI" -> 

< compound > 

< naine> LacticAcid</naiw> 

< molecular Weight > 90.08 < McuiarWeiglit > 

<e]qieriment> 
<pH>6.45<4iH> 

< temperatureKeivin > 298.1 50000 </teinperetureKeivin > 

< DSSRatio > 0.533668 </DSSR8tb > 

< concemratlon>4.308710</concBnlr8tion> 

< timestamp > Sept S 2001 </tin)est8nip > 

< sourceFID > lacticaddLftf </soiirceRO > 

< magnetMHz > 400. 120899 </magn8tMH2> 

< spectraiWidthHz > 6006.006008 </spectrelWidthHz > 
<fexpernnein> 



< cluster > 

<protonNu{nber> 3 <|protonNuinber> 
<i|uentification> 1 <;quantiflcation> 
<subChister> 

< centerPPM>y»m,|pW+fcj <ieeBterPPM> 

< lorentzienWidthAd!iiist> 1.562004<ilorentzienWidthAdiust> 
<pe8k> 

< offsetCenterHz > -3.453329 < /off setCeirterHz > 

< iie!glit> 0.9a3656</iielgtt> 
<protonRatlo> 1.487641 </|irotonfletlo> 

</peek> 
<pealr> 

< offsetCenterHz > 3.453329 </offsBlCemefHz > 

< ln}glit> 1.000000<Aaiiiftit> 

< prirtonRatfO 1.512359</|)rotonRatio > 
</peak> 

</8uliCIuster> 

</ch»star> FIG. ISA 
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<ehist8r> 
<protonNunibBr> 1 </protonJ\lumbfir> 

< quantification > 0 < /quant'rftcation > 
<subCluster> 

<c8nterPPM> y»in2(pli)-i-b2</centerPPM> 

< (orefltaanWidtiiAdjust > 7.044502 < AorentzlanWidthAdpist > 
<p«ak> 

< offs8tC6nterHz> -10371092 < /offsstCenterHz > 
<li8iolrt> 0.054671 <ibeiBlit> 

< protonRatio > 0.1 24148 < /protonRatio > 
</pe8k> 

<PbA> 

<0ffsetCenterHz>-3.448909</offs8tCentBrHz> 
<lieiglrt>0.1B4509 </lnight> 

< protonRatio > 0.373572 < /protonRatio > 
</peak> 

< offs8tCenterHz> 3.456819 </offsetG8nterHz > 

< beiglit > 0.1 63878 < /hoght > 

< protonRatio > 0.372141 < /protonRatio > 
</p88k> 

<peak> 

<offs8tCenterHz> 10.371092</offsetCenterHz> 

< heiglit > 0.057309 < /height > 

< protonRatio > 0.1 301 39 < /protonRatio > 
<lfaA> 

</subGiust8r> 
</ciust8r> 
</coinpoiind> 



FIG. 15B 
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Generie Lactic Add record with Loolcup Table Unit 



<?OTilv8fsion-"1.0"?> 

< I- coropoundDB ver5ion«"2.0* - > 
<eonpoond> 

<naim> LacticAeidOhame > 

< molecular Wright > 80.08 < /motecularW^ht > 



< experiment > 
<pH>5.45</pH> 

< tenveretureKehrin > 288.1 SOIXM) < /temperatunKehnn > 

< DSSRatio > 0.533668 < /DSSRatIo > 

< concentration > 4.30871 0 < /concentration > 
<tiinestemp> Sept 5 2001 </tlmestanv> 

< sourceFID > lacticaddfid < /sourceRD > 

< magnatMHz > 400.120699 < /magnetMHz > 

< spectralWidthHz > 6006.006006 < /spectralWidthHz > 
</experimait> 



< cluster > 

< protonNumber > 3 < iprotonN wnber > 

< ipiantifieatkm > 1< /quantffieatlon > 
<stdiCkist8r> 

<centerPPiyi > LOOKUP TABLE LIMKI </ceBterPPiUI > 

< iorentaanWidthAdjust > 1 .562004 < ilorentzianWidth Adjust > 
<pealt> 

<offsetCenterHz>-3.453328 <;offsetCenteiHz> 

< height > 0.983656 < /height > 
<protonRatio> 1.487641 </prDtonRat!0> 

</p8alt> 
<pe8l(> 

<offsetCemerHz> 3.453329< /offsetCenterHz> 

< height > 1 .000000 < /height > 
<protonRatio> 1.512350 </protonRatio> 

</peak> 
</subCkister> 

FIG. 16A 
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< cluster > 
<|)rotonNimtfier> 1 </|)rotonNuni)9r> 
< quantiff cation > 0 < /quantification > 
<siACIuster> 

<centsrPPM> LOOKUP TABLE UHia</c8merPPIM> 

<torenl2ianWidthAiljust>7J)44502</lorantzianWidthAdpist> 

<p8ak> 

< offsetCenterHz > -1 0.371 092 < /offsetCent8fHz> 

< height > 0.054671 </h8ielit> 

< protonRatIo > 0.124148 < liirotonR8tio> 
</pe8lc> 

<peak> 

< OffsetCenterHz > -3.448909 < /offsetCenterHz > 

< height > 0.164509 < /height > 

< protonRatio > 0.373572 < /protonRatio > 
</peal(> 

<p8ak> 

< offsetCenterHz > 3.45681 9 < /offsetCenterHz > 

< height>0.163878</he!ght> 

< protonRatio > 0.372141 < AvotonRatio > 
</il»ak> 

<peak> 

< offsetCenterHz > 10.371092</offsetCenterHz> 

< height > a057309 </bBight> 

< protonRatfo > 0.130139 < /protmRatio > 
</pe8k> 

</subChister> 
< /cluster > 
< /Compound > 



no. 16B 
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PROCESS, APPARATUS. MEDIA AND SIGNALS FOR AUTOMATICALLY 
IDENTIFYING COMPOUNDS IN A SAMPLE 

BACKGROUND OF THE INVENTION 
5 - 

1. ReM of invention 

This invention relates to qualitative and quantitative chemical anat^, and 
more particuiarly to processes, apparatus, media and signals for automatically 
10 identifynDg compounds in a sample. 

2. Description of Related Art 

The field (tf btometric identification has groivn tremendously over tiie recent 
1 5 decade botti from its relevance to medical diagnostics and to its application as 
a way to uniquely identify a person or an animal, for example. As diagnostic 
tools have become more sophisticated, complex liquid mbctures. such as 
human blood or urine for examine, can now be analyzed to identify or search 
for partKular compounds ttiat can provide important diagnostic infonnatton to 
20 a medical technician or a doctor. 

Generally, the separation and characterization of mbctures is fundamental to 
neariy every aspect of analytical chem»try and biochemistry. Most 
approaches to identify and quantify biological compounds in liquid mixtures 

25 require an initial compound separation (chromatographic or physical 
separation} step to separate a particular compound or set of compounds from 
ttie mixture. For example, gas chromatography, electrophoresis, and liquid 
difomatography are used to separate pure chemteal 
components/compounds, for example, from a mixture before analysis Is 

30 performed. Initial compound separation is required because most spectral 
identifteation processes, such as mass spectrometry or infrared, visible, and 
ultravioM spectroscopy, require relatively pure samples in order to minimize 



noise and increase the accuracy of the measuring device. Spectral 
identification processes are expensive, manually intensive and require a great 
deal of technical expertise to be perfonned properly in an accurate, timely 
manner. 

Nuclear magnetic resonance (NMR) has recently been shown to be an 
attemative approach to identify and quantify biological compounds without 
chromatographic separation, in this approach, radio frequency (RF) 
electromagnetic radiation is applied to a mixture of organic compounds to 
extract and measure a characteristic RF absorption spectrum of nuclei 
belonging to each specific organic compound. A large number of compounds 
are associated with well-defined peaks In the absorption spectrum and 
Icnowing which peaks are associated with certain compounds makes it 
possible to manually identify some of the compounds in the liquid mixture 
without resorting first to chromatographic separation. However, this process is 
still quite slow and requires a great deal of a priori infomiatfon that relates 
each peak to a given compound. It can take a number of years for experts in 
NMR spectroscopy to acquire the knowledge required to analyze NMR 
spectra to accurately Wentify and quantify compounds in sample mixtures. 

Therefore what is desired is a process and apparatus for quickly, accurately 
and automatically identifying a number of compounds which may be present 
In complex liquid mixtures without involving chromatographic separatton and 
without requiring people who are experts In NMR techniques. 

SUMMARY OF THE INVENTION 

Overall Process 

The embodiments of the Invention disclosed herein provide for automated, 
accurate analysis of a test spectnim obtained from a sample, to quantitatively 
and qualitatively Mentify compounds present in the sample. 



In accordance with one aspect of the invention there is provided a prx)ces8 for 
producing a trace file for use in spectmm analysis, the process Involving 
perfomning a Fourier Transfonn on Free Induction Decay (FID) data to 
produce an initial spectnjm. filtering a selected region of the initial spectrum to 
produce a filtarad spectrum, and phasing the filtered spectrum to produce a 
measured spectrum having a flat t>aseiine and positive peaks. 

In accordance with another aspect of the invention there is provided a 
computer readable medium for providing codes operable to direct a processor 
circuit to produce a trace file for use in spectrum analysis, the computer 
readable medium comprising codes for automatically causing the processor 
circuit to perform the process described atx)ve. 

In accordance with another aspect of the invention there is provided an 
apparatus for producing a trace fOe for use in spectrum analysis, the 
apparatus comprising means for automatically perfonnlng a Fourier Transfonn 
on Free Induction Decay (FID) data to produce an initial spectrum, means for 
automatically filtering a selected region of the initial spectrum to produce a 
filtered spectrum, and means for automatically phasing the filtered spectrum 
to produce a measured spectnjm having a flat basdine and positive peaks. 

In accordance with another aspect of the Invention there Is provided a signal 
for causing a processor circuit to produce a trace file for use in spectrum 
analysis, the signal including a first segment comprising codes for 
automatically causing the processor circuit to perform the process described 
above. 

In accordance with another aspect of the invention there Is provided a 
computer-implenfiented process for pioduclng a representatfon of a spectmm 
for a hypothetteal solutton containing a compound, for use in determining the 
composition of a test sample, the process involving producing a positfon value 
for at least one peak of a base reference spectmm as a function of a 



measured condition of the test sample, and a property of the at least one peak 
in a base reference spectrurn. 

In accordance with another aspect of the Invention there is provided a 
computer-readat)le medium for providing computer readable Instructions for 
causing a processor circuit to produce a representation of a specbrum for a 
hypothetical solution containing a compound, for use In detemninlng the 
composition of a test sample, the instructions comprising a set of codes for 
directing the processor circuit to perform the computer-implemented process 
described above. 

In accordance with another aspect of the invention there Is provided a signal 
operable to cause a processor circuit to produce a representation of a 
spectrum for a hypothetical solution containing a compound, for use in 
detennining the composition of a test sample, the signal comprising a signal 
segment comprising codes operable to cause the processor circuit to perfomn 
the computer-implemented process as described at)ove. 

In accordance with another aspect of the invention there is provided an 
apparatus for producing a representation of a spectrum for a hypothetical 
solution containing a compound, for use In detennining the composition of a 
test sample, the apparatus comprising a processor circuit programmed to 
produce a position value for at least one peak of a reference spectmm as a 
function erf a measured condition of tiie test sample, and a property of the at 
least one peak in a base reference spectrum, 

In accordance witti another embodiment, Oiere Is provkjed an apparatus for 
producing a representation of a spectrum for a hypothetical solution 
containing a compound, for use In determ.lning the composition of a test 
sample, the apparatus comprising means for recehdng a measured condition 
value representing a measured condition of the test sample, means for 
receiving a representation of a position of at least one peak in a base 
reference spectrum, and means for producing a position value for at least one 



peak of a derived reference spectrum as a function of the measured condition 
of the test sample, and the position of the at least one peak In a base 
reference spectrum. 

Other aspects and features of the present vivention will t}ecome apparent to 
those ordinarily skilled In the art upon review of the foBowIng description of 
specific embodiments of the inventton In conjunction with the accompanying 
Rgures. 

BraEF DESCRIPTION OF THE DRAWINGS 

In drawings which ilhistrats embodiments of the invention. 



Figure 1 



is a system for determining the quantity of compounds in a test 
sample, according to a first embodiment of the invention; 



Figure 2 is a flow chart illustrating an automatic process for conditiomng a 
measured spectrum, as Implemented by a workstation shown in 
Figure 1; 

Figure 3 Is a pictorial representation of a measured spectmm produced 
by the workstation shown in Figure 1 ; 

F^re4 is a flow chart of a routine executed on the wofkstatk>n shown in 
Rgure 1, for conditioning the measured spectrum to suppress a 
peak caused by a solvent in a sample for whteh the measured 
spectrum Is produced; 

Figure 5 is a flow chart of a process for Mentifying compounds executed by 
a spectrum analysis apparatus shown in Figure 1; 

Figure 6 is a pictorial representation of a reference spectnjm associated 
with lactk: acid at pH of 5.10; 

F^ures 7A and 7B are a tabular representation of an Extensible Markup 
Language (XML) file representation of the reference spectmm of 
Figure 6; 

Figure 8 is a flow chart of a process by which base reference spectrum 
records such as shown in Figures 7A and 7B may be produced; 

Figures is a process executed by the spectrum analysis apparatus 
shown in Figure 1 to kientify a peak associated with a calibration 
compound in a test spectrum; 



Figures 10A and 10B are a flow chart of the process for identifying 
compounds, shown in Figure 5. in greater detafl; 

Figure 11 Is a flow chart of a process executed by the spectnim analysis 
apparatus for detemtining a pH value from the test spectnim; 

Figure 12 Is a flow chart of a process executed by the spectmm analysis 
apparatus for producing a derived reference spectnim; 

Figures 13A and 13B ara a tabular representation of a base reference 
spectmm record associated with lactic acid at a pH of 5.45; 

Figures 14A and 14B are a tabular representation of a derived reference 
spectrum record associated with lactic acid at a pH of 5.28; 

Figures ISA and 15B are a tabular representation of a generic type of derived 
reference record in which equations specify center Parts Per 
Millton (PPM) values for peak clusters, according to one 
embodiment of the invention; 

Figures ISA and 168 are a tabular representation of a derived record 
comprising lool<-up table links to center PPM values according 
to another embodiment of the invention: 

Rgure 17 is a flow chart of a process for detennining an upper bound 
concentration estimate: 

Figure 18 is a flow chart of a feast squares fitting routine referenced by 
Figure 108; 



DETAILED DESCRIPTION 

Referring to Figure 1. a system, according to a first embodfrnent of the 
invention, for detemtining the quantity of compounds in a test sampta is 
shown generaliy at 10. The system includes a spectrum producing apparatus 
5 12 and a spectrum analysis apparatus shown generally at 14. in this 
emkxxliment, the spednim producing apparatus 12 is a Nuclear Magnetic 
Resonance (NMR) System provMed by Varian (RTM) inc. of CalHbmia, U.SA 
Generally, the system is operable to receive a specially pr^red liquid 
biological test sample and produce a data file comprised of a plurality of (x.y) 
10 values which define a measured NMR spectrum. This measured NMR 
spectrum is then supplied to the spectrum analysis apparatus 14, where a 
process according to another aspect of the invention is carried out to provide 
an indication of the quantities of certain compounds in the specially prepared 
biological test sample. 

15 

The system 10 is suitable for use with biological samples, for example blood 
or urine, in which the solvent is water, for example. Such samples may be 
"prepared" by doping them with a small quantity of a condition indicator 
compound, also refen^ed to as a condition reference compound, and a 

20 chemically inert chemical shift calibration standard compound also referred to 
as a calibration compound. The condition indicator may be trimethylsilyl-1- 
propanoic acid or Imidazole, where the distortion fector is pH, for example. 
Altemath^ely, the sample itself may have a naturally occurring, inherent 
condition indicator such as glycine, creatinine, urea, citrate, or trimethylamine- 

25 N-oxide, for example. The chemical shift calibration standard compound may 
be 3-(trimeth^sllyi]-1-propane$utfonic add, also known as DSS, for example. 
Alternatively, the chemical shift calibration standard may be 
dimethylsulphoxide (DMSO), acetone, or tetramethylstlane (IMS), for 
exam|:rfe. 
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In this embodimwt, the spedmrn producing apparatus 12 is comprised of a 
computer woricstatton 16. an auto sampler 18, a test chamber 20, and a 



console 22. The workstation is a Sun Workslation with a 400 MHz 
UltraSPARC lii CPU with 2 IWB level 2 cache. 128 MB RAM. on-k)oard PGX24 
graphics controller. 20 GB 7200 r.p.m. ElOE hard disk 48x CD-ROM drive 
1.44 MB floppy drive and ir flat screen color monitor. The workstation runs 
Varlan (RTM) VNMR software which includes routines for controlling the auto 
sampler 18 and the console 22 to cause the specially prepared titotogicell 
liquid sample to be received in the test chamber 20 and to cause the console 
to acquire and provMe to the workstation Free Inductton Decay (FID) data 
representing the free induction decay of electromagnetk: radiatton absorpttons 
produced by protons In the compounds of the liquM sample as a result of 
changes in magnette properties of the protons due to a nudear magnetic 
resonance process initiated in the test chamber 20 by the console 22. 

Process for Producino a Miwy mied Sr>flctitim 

The FID data is received and stored In memory at the worfcstatkm 16. Then. In 
this embodiment, a process according to an embodiment of another aspect of 
the Inventton. is carried out to cause the workstation to produce a measured 
spectmm fbr use by the spectnim analysis apparatus 14. Instructions for 
direptlng the workstation to automatically cany out the process for producing 
the measured spectnim are embodied in computer readable codes 24. These 
computer readable codes 24 may be provMed to the workstatfon 16 in a 
variety of different fbnns including a file or files on a computer readable 
medium such as a CD-ROM 26. or floppy disk 28. for example, or as a file 
received as a signal frorn a cbmmunicatfons medium such as an internet 30, 
extranet or intranet, electrical 32. Radio Frequency (RF) 34. or optfcal medium' 
38 or any other medium by which a file comprised of said codes may be 
provkled to the workstation 16 to enable the workstation to be directed by the 
codes to execute the process described herein to produce a measured 
spectrum. 



Autoprocessipfl 

Generally, an automatic computeMmplemented process for produdng a 
measured spectrum from NMR data, may involve operating on free induction 
decay (FID) data produced by a spectrometer to produce a trace file 
comprised of intensity and frequency values representing a measured 
spectrum having a flat baseline and well defined peaks that have positive, 
well-defined areas, for use in a computer-implemented spectrum analysis 
process such as the process described herein. In particular, the process may 
involve perfomiing a Fourier Transfonm on Free Induction Decay (FID) data to 
produce an Initial spectnim, filtering a selected region of the initiai spectmm to 
produce a filtered spectrum and phasing the filtered spectarum to produce a 
measured spectrum having a flat t)aseline and welt deflned positive pealcs. 

Refem^ng to Figure 2, a flowchart depicting functional tdocks implerTmnted by 
the codes to cause th9 woricstation to execute a specific process for 
producing a measured spectmm is shown generally at 50. The process 
begins with a first block 52 that causes the workstatton 16 to read and perform 
an initial Weighted Fourier Transfomi on the FID data to produce an Initiai 
measured spectmm representing signal intensity (i) versus frequency (F). 

Then block 54 causes the woricstation 16 to produce parameters for use in a 
later-executed Fourier Transform perfbnned on the FID data to prcxJuce a 
representation of a measured spectmm having well defined Lorentzdan lines 
with a flat baseline and peaks that have positive, welt-defined areas. Thus, 
the result of block 54 is a set of parameters that controls Fourier Transforms 
later performed on the FID data to produce a representation of a measured 
spectmm. 

Block 56 directs the woricstation to save the set of parameters in association 
witti the FID data. Block 58 directs the woricstation 16 to perfonn.a Fourier 
Transfonm on the FID data, using ttie parameters produced by block 54 to 
produce a trace file, which is a file comprised of a plurality of (x,y) values that 



represent a trace of the measured spectrum, representing Intensity versus 
fiBquency. Block 59 then caus^ the workstation to save the trace fRe for 
transmlsston to the spectmrn analysis apparatus 14 shown in Figure 1. 

An example of a measured spectrum is shown generally at 41 in Figure 3 
The spectrum Is a ptot of Intensity versus frequency. The x^xls 43 Is 
referenced to parts per mlHton (ppm) and depicts a window of the overall 
spectmm. the window containing relevant Infbmiatlon or features for 
Identifying compounds In the sample. The y-axis 39 Is referenced to a zero 
value and the spectmm has a baseline 37 iBpresenting a noise level from 
which a plurality of peaks 45. 47. 49. 51. 53. 55. 57. 59. 51. 63 associated 
with various compounds m the sample extend. For example peaks 45 and 47 

are associated with ImWazole. peak 49 Is associated With Urea and peaks 51 
and 53 are associated with Creatinine. Peaks 55 and 57 fomi a first cluster 
associated wHh citric acW and peaks 59 and 61 fom, a second cluster 
assoaated with that compound. Peak 63 Is associated with DSS. the 
calltHatton oonqxjund. 

Referring back to P^re 2. bkx* 54 which processes the FID data. Is shown 
.n greater detail. Block 54 Includes sub-fiinctional blocks Including a Fourier 
Transfbm, btock 60. a filter selected and/or solvent region btock 66 and an 
automatte phasing Wock 68. each of which is automatically executed in turn 
in the order shown. The process may include an optional spectral window 
setting btock 62 and an optional drift conectfon block 64, to further process 
the spectmm. for example. 

The Fourier Transfomi block 60 has an optfonal sub-block 70 that causes the 
workstation to perfomi a weighted Fourier Transfbmi with weights that provkle 
for enhancement of the Initial spectmm. These weights may perfomi a line 
broadening function to the initial spectrum, for example. To do this in this 
embodiment, bkx* 70 causes the woricstatfon to set signal enhancement 
parameters for use in a subsequently executed weighted Fourier Transfom, 



block 72. Such signal enhancement parameters may effect line broadening, 
line narrowing, or gaussian sine-bell conditioning, for example, to the resulting 
spectnim produced by the Fourier Transform block 72. In the Varian (RTM) 
VNMR software, this is effected by setting a line broadening variable "lb* to a 
5 specified value, which may be 0.5, for example. Also in the VNMR sofhrare, 
the weighted Fourier Transform may be executed by calling the VNMR macro 
"Wfr to perfonn a weighted Fourier Transfonn on the FID data, using the lb 
parameter value set at block 70. This has the effect of broadenir^ the lines or 
peaks of the spectrum and averaging the spectrum to produce a measured 
10 spectrum with a better signal to noise ratto than wouki be produced without 
averaging. It also has ttie effect of eliminating glitches to produce a measured 
spectrum of better quality. 

In this embodiment optional bkick 62 causes the workstation 16 to define a 
15 window on the initial spectrum and this nnay involve scaling tiie initial 
spectrum, it is desirable to set the spectral window to a preset size, i.e. a 
pre-defined range of frequency, to enable the acquisition of repeatable data 
and for all useful data to be in a pre-defined window and to scale ttie 
spectmm such that the height of its maximum peak is a percentage of tiie 
20 height of tiie window. In this embodiment, tills is effected through ttie VNMR 
software by executing tiiree sub-functional blocks 74, 76 and 78 ttiat cause 
ttie workstation 16 to call ttie VNMR macros T, 'Yuir, and tiie VNMR 
command "Vsadf , respectively, in the order shown. The T macro sets display 
parameters "sp** and '*wp* for a full display of a ID spectnim, tiie luir macro 
25 sets display limits for a full screen so that the spectmm can be seen as wide 
as possible in the window, and the VsadJ' command sets up automatically the 
vertical scale 'Vs" in ttie absolute intensity mode "ai*, so tiiat ttie largest peak 
is of tiie required he^ht Effectively this provides for scaHng of ttie spedrum 
so ttiat ttie highest peak is 90% of ttie total window height 
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Optional block 64 causes ttie workstation to produce parameters tiiat perfomi 
drift conBction on the spectrum to correct ttie measured spectmm for drift 



effects, effectively setting the two extremes of the baseline of the spectrum. 
i.e. the left and right sides of the spectrum to have zero stope. In this 
embodiment, using the Varian (RTM) \/mR softiware. this is achieved by 
blodt 80 which causes the workstation 16 to call the "dc" macro of the VNMR 
software. Effectively the "dc" macro calculates a linear baseline conectlon. 
The beginning and end of a straight line to be used for baseline correction are 
detennined from the display parameters "sp" and ^tp'. The "dC command 
applies this connection to the spectrum and stores the definition of the straight 
line in the parameters Tvl" (level) and "tlP (tUt) of the VNMR software, (cdc 
resets the parameters "IvT and HT to zero.) 

Biodc 66 causes the workstation to filter a selected region of the spectnim to 
adjust the intensity of the spectnjm in that region, Filtering may involve 
applying a notch filter to a selected or soTvent region, for example, to suppress 
a peak associated with a contaminant or solvent in the contaminant or solvent 
regton. This ensures that the solvent regton or contaminant region of the 
spectmm is correctly phased with the rest of the spectnim so that the entire 
spectrum can be property phased later. In order to pemiit the entire spectwm 
to be phased, the solvent or contaminant residual must be in phase with the 
rest of the spectmm. Meally reducing the solvent or contaminant region to 
zero. The sclent regkm is the regbn of the spectnim in which solvent 
compounds in the sample may be found. For example the solvent may be 
water, in whteh case the regbn around the peak in the measured spectnim 
associated with the compound H2O is considered to be the solvent region. 
The contaminant regton is a regton of the spectnim where pealcs associated 
with contaminants are present 

Referring to Figure 4. a routine for filtering the selected regton is shown 
generally at 66 and involves a first block 92 that causes the woricslatton 16 to 
apply a notch filter to the selected regton to suppress a peak in that regton. A 
set of initial notch filter parameters specifying the attenuatton. width and 
positton of the notch filter Is used. 



Applying a notch filter may further involve producing an adjusted set of notch 
filter parameters and applying a notdi filter employing the adjusted set of 
notch filter paramieters to the selected region. The set of notdi filter 
parameters may be adjusted to pnxktce an adjusted set of notch filter 
parameters that may be appHed to the notch filter to filter the selected region 
until a sum of the absolute values of areas defined by peaks above and below 
a baseline of the initial spectnim is minimized. In this embodiment this is 
done by block 94 which causes the workstation to adjust the set of initial notch 
filter parameters and re-af^ the notch fitter until the sum of the absdute 
values of the areas of the spectnim in the selected region, is minimized. One 
quick way of doing this and minimizing the number of iterattons of appHcatton 
of the notch filter is to empk^ num^cal methods to successive values 
produced. For example, in this embodimmt. using the Varian (RTM) VNMR 
software, the parameter "sslsfrq" specifies a nokh filter value that affects the 
minimization of the sum of the areas above and beknv the baseline. Brent's 
method, as described in Brent. R.P. 1973. Algorithms for Minimization without 
Derivatives (Englewood (Ms, NJ: Prentice-HaH). Chapter 5, {1J. for example 
may be used to find an optbtHim value for "sslsfrq". 

Referring back to Figure 2, after fiRering ttie selected region bk>ck 68 is 
invol(ed to automatically phase the entire spectrum and make the peaks as 
symmetrical as possible. This may be done iterativeiy. for example, by 
adjusting tiie real and fenaginary components of the tiansfomned FID data until 
the resulting spectnim has positive, well defined peaks. In tiits embodiment, 
employing the Varian (RTM) VNMR software, ttiis is achieved by invoking 
btock 84 whfch calls ttie "aphO" command of ttie VNMR software. Some 
versions of the VNMR software may require more ttian one successive 
execution of the aphO command. 

After automatic phasing parameters of the spectium have been produced, 
optionally, a baseline correction block 69 may t>e executed to flatten out the 
baseline of ttie spectrum. Altematively, baseline correctkm may be performed 



later. Baseline conrectlon may be done by analysing the spectrum to 
determine areas with peaks and areas devoid of peaks and setting areas 
devoid of peaks to have a common intensity value such as zero, for example. 
An example of baseline conectk^n available at 
www.acdlabs.com/publish/nmr_ar.html published by Advanced Chemistry 
Development Inc. of Toronto, Ontario, Canada. 

Block 56 then causes the workstation 16 to save parameters produced by the 
various sub-processes of bk)ck 54 In assodatfon with the FID data and text, if 
desired. With the Varian (RTM) VNMR software this may be achieved using 
the '8vf($8avefid)' command. 

Block 58 then directs the woricslatton 16 to produce a trace file comprised of 
(x.y) values representing intensfty versus frequency, by pertbrniing a Fourier 
Transfom™ on the FID data, using the parameters produced as described 
above and associated with FID data. The trace file is then transferred or 
transmitted to the spectmm analysis apparatus 14 or is stored for later 
transfer to that apparatus. 

gpectwm Analvsh Arm^rs^^^ 

In the embodiment shown, the spectrum analysis apparatus (SAA) 14 is a 
separate component and includes a Unux woricstation configured to receive 
the trace file representing the measured spectmm. from the spectmm 
producing apparatus 12. The spectrum analysis apparatus 14 is configured to 
receive and execute lnstnictk)ns embodied in computer readable codes to 
carry out a process for Wentifying compounds in a sample according to an 
embodiment of another aspect of the im^ntfon. The codes may be prevWed 
to the specbrum analysis ^raius through any of the media described above 
Including the CD4?0M 26. Ftoppy disk 28. intemet 30. extranet. Intranet, 
etectrical 32. RF 34. and optteai 36 media and/or any other media capable of 
provWing codes to the spectmm analysis apparatus. 



it wHI be appreciated that the workstation 1& may altematively be configured 
with both the codes to effect the process for producing a measured spectnim 
shown in Figure 2 and the codes to effect the process for identifying 
compounds, or either of these. It is desirable however, to execute the 
5 process for identrfying compounds at a computer other than the workstation 
16» to enable the process for identifying compounds to be executed whBe 
another sample is being subjected to the NMR process, for example. 

Process for identBvinQ Compounds 

10 Refening to Figure 5, generally, the process for identifying compounds 
involves identifying representative reference spectra from a set of refersnce 
spectra associated with detectable compounds and selected according to a 
condition of the sample, which collectively define a composite reference 
spectrum having features matching a set of features in a test spectrum 

15 produced from the sample. Once the representative reference spectra have 
been identified, compounds with which they are associated may be identifted. 

The compounds associated with respective reference spectra of the klentified 
set are the compounds that may be expected to be present in the sample. 

20 Quantities of the compounds may be detenmined from the intensities of 
certain representative peaks in the test spectrum which are associated with 
the compounds, relative to the intensity of a peak associated witti tiie 
chemical shift calibration standard compound which is unaffected by the 
condition of the sample. A condition may be the pH of the sample, for 

25 example, and an accurate measurement of pH can be obtained from the test 
spectmm. Thus, given a test spectmm of a sample and given a set of 
reference spectra, the process can identify and quantify compounds present 
in ttie sampfe. Altematrvefy, the condition may be temperature, osmal^, salt 
concentiation. chemical composition, or solvent, for example. 
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Reference Spectra 

Before the process for identifying compounds can be carried out. a set of 
reference spectra for compounds to be detected In the sample must be made 
available to the SAA 14. This can be done by storing data relating to 
reference spectra associated with respective compounds and allowing the 
SAA 14 access to the data. An exemplary reference spectium for a given 
compound may Initially be represented In the form of intensity versus 
frequency (x.y) values, which may be represented graphically. A reference 
spectmm for lactic acid Is shown in Figure 8, for example, tt win be 
appreciated that such a spectmm may have a plurality of peaics and/or 
clusters of peaics 150. 152, 154. 156. 158. 160, 162. 164, 166. 188, 170 
superimposed upon a featureless background, such as noise 172. The 
resolution along the x axis is dependent upon the frequency of the Magnet 
used in the Nuclear Magnetic Resonance Process emptoyed to acquire the 
sample. The peaics that are associated vi/ith lactic acid are found in first and 
second dusters 166 and 154. These dusters are centered at 1.322 ppm and 
4.110 ppm respectiveiy. The first duster is comprised of two peaics and the 
second duster 154 is comprised of four peaics. 

A reference spectrum of the type shown In Figure 6 can be represented In 
various fonnats induding mathematical representations such as Lorentzian 
equations whfch may specify peaics associated with the compound the 
spectram Is Intended to represent. Such equations have the fomi: 

f(x)s aw* 
w**4(x-c)* 

where: a represents amplitude of the peak 
w represents width of the peak; and 
c represents the center of the peak 



Thus, for example, the two peaks assodated with the duster centered on 
1.322 ppm may be spedfied t)y two sets of LorentzJan line shape parameters 
a, wand c. 

The Lorentzian line shape parameters for eadi peak associated with a ghren 
compound may l>e stored in a base reference spectrum record embodied in 
an XML file as shown in Figures 7A and 7B, for example. Such a file may 
have fields 200. 202 and 204. for ^cample, for storing compound information, 
experiment information and duster/^ic information respectively. The 
compound information field may include sub-fields for storing the name of the 
compound with which the reooid is associated, and the molecular weight of 
the compound, for example. The experfonent field may have sub-fields for 
storing Informatfon about the experiment, such as conditions under which the 
peak infonnatfon about the compound w^ collected. This may indude the 
pH of the solution that was analyzed, the temperature of the sdutton. the 
calibratton reforence compound ratio, the concentration of the compound in 
the soiutton, a timestamp, a sourcefile name, the frequency of the magnet 
used in the NMR process, and the spectral wkith of the entire spectrum, for 
example. The duster/0eak infonnatton fields may include separate fleMs 206 
and 208 for each duster (166 and 154 in Figure 6). 

Each duster field 206 and 208 may indude sub fiekis 210, 212, 214, 216 and 
218 for representing infonnatfon relating to the proton number of the duster, 
the quantification erf the duster, the Lorentzian line width adjustment of the 
duster and first and second peak subfields respectively. The first and 
second peak subfields may indude fields 220, 222, and 224 for representing 
offset center information, height informatfon and proton ratio infomnation 
relating to a respective peak in the duster, rsspectivefy. 

Effectively, the Lorentzian line shape parametere (a) and (c) for each peak 
may be stored in the height and offset center fields 222 and 220 respectively 
and each peak in a given duster is conskJered to have the same width (w) 



Which is specmed by the content, of lh« Lomntzlan In. widlh adjust Hew 214 
associated with the duster. 

Refentag lo Figu« «. a pmcess by which base «(«ence spectmm recent 
may be p«,duced Is Shown generally a. m The p»<»ss begins wHh blod, 
232 tepres«rtlng the p«pe«|o„ of . PquM solution containing . refe™™» 

"mpound such as lactic acid, a calihalion compound as DSS and a 
condition indicator «mpo«,d such as Wdazole. Tl* liquid solution I. 

pmpared to a carefulVcalihatedco™»nt»tlon of the caiibrrtion compound at 

a^«fUlly oontmiw ,»„pe«„„ 3nd PH. This «ep is oanfed out In a 

laboratory, byahunw, or by.mschanteedpnoe.Mbrex.mple. 

Onos liquid solution cont-ning .he. raferance. compound has been 
P™*.«d. a. «K«n in block 234. it is .ubMed to the f««R p»oce^ 
out by the appaatus 12 shown m FIgura 1 to produce RD data. 

« btodt 236. the apparatus 12 a*iscts the FID dab produced by the NMR 
P»c«s to »e proc«» Shown in 2. to , ^ 

spectaim. 



Hav«9 otaahed a messurad raltoenc. spwnm, a process as Shown in 
bto* 23. I. mitto.^, to kWiy the ca«bra«on expound and obtain 
«««»3tK«. parametera. TXIs pmces. is shown In greater detail a. 23. in 
Flflura.. to Fl^ra».lhecodssdlract the SM 14 to derive from 

the measurad rs«s»n» spectmm a characterisation of the callbra«on 
compound conta»»d In the s»npfe This involves 1^^^ 

l»ak of the measured rrtrence spectmm that meets a set of Criteria that 
W»oclate the peak With the ca»ratk,„ compound and Ibrther Involve. 
P«<»i«lng parametM Ibr a mathematical model of the peak, mat best 
"Prawnb <h. peak. ThuMn Ihis embodiment the characterizatfon i. a IW of 
Lor««zl«, una *ape parameter, (w, c and a) rapresentlng wkfth. peak 
posltK)n and centw amplitude respectively of a l^«nl2ian curve t^ 



describes a feature, that is, a peak, of the measured reference spectrum, that 
is assodated with the calibration compound, it will be appreciated that other 
characterizations could be used, such as those produced by peak picking, 
linear least scipjares fitting, the Levenberg-Marquardt method, or a 
5 comt^atton of these methods. 

Tofuftd a peak associated with the calibratton compound and to produce a list 
of Lorentzlan line shape parameters that characterize it, the SAA 14 is 
programmed with codes that include a first Mock 250 that directs the SAA 14 

10 to determine a noise level at a pre-defined area of the measured reference 
spectnim. In this embodiment, it known that an area on the x-axis 
(frequera^) corresponding to posittons 64,000 and 65.000 for example can.be 
expected to be vokl of peaks and contain only noise. The standard deviatkm 
of the y^value (s^nal intensity) over this region of the measured reference 

15 spectnim is representative of the noise level of the entire spectrum and 
ppovMes a measure of the noise level. 

Next Uock 252 directs the SAA 14 to scan the measured reference spectoim 
in the negative x-directton beginning at the higher order end of the spectrum. 
20 to find a y^value that meets a certain criterion. For example, the criterton may 
be that the y-value must exceed the noise level by a pre-determined amount, 
such as a factor of 10, at the top of a peak. A y-value meeting this criterion is 
assumed to be associated with an x-vahie that represents the positton of a 
peak associated with the c^bration compound. 

25 

Blod( 254 then directe the SAA 14 to employ the x-value representing the 
approximate positton of the calibration peak in the test spectmm in a fttting 
algorithm that fits a curve to the calibration peak and specifies wkith, height 
and position values. For example a Lorentzian line shape-fitting algorithm 
30 may be employed to produce Lorentzian line shape parameters (a. w and c) 
that d^ne a Lorentzian line shape that best matches the calibratk>n peak. 



Referring back to Figure 8. having calculated Lorentzian line shape 
parameters that Identify and characterize the calibration compound, block 240 
is carried out to associate other Input data with the measured reference 
spectrum. Other Input data may include Infbmiatfon associated with the name 
and experiment fleMs 200 and 202 and infomnatton such as the number of 
protons (proton number in XML file) for each duster and the proton ratk) for 
each peak, for example. 

Next at bkjck 242. the measured reference' spectrum is characterized by 
emptoying the well-known Conjugate Gradient method to detemiine 
Lorentzian line shape parameters (a. w and c) for each peak or to determine 
sets of such paramelere that define a mathematteai model or models of peaks 
that best fits the Important peaks of the measured reference spectrum. 

At btock 244, a base reference spectrum record of the type shown in Figures 
7A and 7B is produced firom the other input data and the charaderizatton of 
the spectrum. At btock 248. the base reference spectrum record is stored in a 
refiwence record library, which effectively Includes a piuralHy of reference 
records for various different reference compounds. For example, the 
refsrence record library may indude base refierence spectrum records for L- 
Phenylalariine. L-Threonine. Glucose. Citric Add. Creatinine. Dimethyfamlne. 
Glydne, Hippuric Add. L-Alanine. L-HlstWIne. L-I^ctfe Add. L-Lysine, L- 
Serine. Taurine, Trimethylamlne, Trimethylamine-N-Oxide. Urea, L-Valine, 
and Acetone. 

Reference records may indude base reference records or derived reference 
records. Base reference spectoim reconJs may be produced by empirical 
processes as described above. New records known as derived reference 
records may be produced by operating on data from base reference records, 
and represent derived reference spectra. Operattng on data may indude 
Interpolation and/or perfomiing mathematical operations, and/or using a 
lookup table, for example. Thus, for example, a limited set of base reference 



spectrum records can be produced, including a record representing the 
spectnjm for lactic acid at a ptH of 5.1, and a record for lactic add at a pH of 
SA$, for example. A derived reference recoid representing the spectrum of 
lacfic add \n a solution having a pH of 5.28, U» example, can then be 
produced by perlbnning mathematical operations on the Lorentzian line shape 
parameters specified by the base records assodated with solutions at pH 5.1 
and pH 5.45 Id interpolate values for a solution at a pH of SJ2B. Thus^ a 
derived set of reference records can be produced for sdutions of any pH, 
within a reasonable range, when required, thereby avoiding a priori production 
of base reference! records for every pH condition. As will be appreciated 
below, this feature may be exploited by determining the f>H value of a sample 
under test and usinjg the detemnined pH value to produce a set of derived 
reference records for use in identifying compounds present in the sample. In 
other words, reference records for use in the process for identifying and 
quantHyIng compounds are selected frnn existing base reference records or 
are 'selected" by producing derived refersnce records. accordNig to a 
conditton of the sample. In this embodiment, tte condition is pH. 

Process for Identifying Compounds 

After having produced a reference library of base refwnence spectnjm records, 
the process of tdentHying and quantifying compouhds in a test sample can be 
carried out 



Process for IdentifYin/, and Quatlfying nr.mp^.^A^ 

The process is shown generally at 300 in Figure IDA and 10B and begins 
with an optional first block of codes 302 that cause the SAA to perform a 
spectmm conditioning step. 

Spectnjm Conriftb^mng 

If the measured NIVIR spectmm of the test sample is of sufficient quality, it can 
be used directly in subsequent operations of the process disclosed herein 
However, usually, the measured spectmm wiH not be of sufficient quality and 
wBI require further processing to condition it for later use. This further 
conditioning may im«)h« baseline conection as described earifer. for 
example, to produce a conditioned spectmm. 

Thus the following description will refer to a test spectmm. which may be the 
measured spednim described above, if such measured spectmm is of 
sufficient quality or it may be a conditioned spectmm; A measured spectrum 
having a corrected baseline, for example, would be an example of a 
measured spectrum that would not need to be subjected to further processing 
to condition ft Usually, however, the process wn involve producing a test 
spectmm from the measured spectmm. 

Calibration D etenminatlr^ 

After being provided with, or after producing, a test spectmm of the type 
descra)ed. the process Im^lves block 304 to produce a characterization of a 
calibratton compound m the sample or block 306 to determine a 
representatton of a condition of the sample. These two fiinctfons can be done 
independently or the detemiinatton of the condltfon of the sample can be 
detennlned after first characterizing the calibratton compound. 

The process of characterizing the callbratfon compound generally Involves 
Identifying a peak associated with a calibration compound. In the test 
spectmm. This may involve identifying a peak meeting a set of criteria that 



associate the peak with the calibration compound. The pealc associated with 
the calibratton compound may be characterized by producing Lorentzian tine 
shape parameters to represent the pealc. 



5 Blodc 304 relating to characterizing the calibration compound involves a call 
to the process shown in Figure 9 to cause the SAA 14 to produce a set of 
Lorentzian values (a» w and c) which best represent the pealc associated with 
the calibration compound in the test spectnim. 

10 Condition Factor Determination 

OpUonaliy, as shown by block 308, a separate measuring device may be used 
to measure the selected condrtkm of the test sample. In this embodiment the 
measured conditton is pH which may be measured by a separate pH meter to 
produce a pH conditicMi value that may be supplied to the SAA as indicated at 

15 "C* in Figure 10. for use in later functions of the process. 

If the conditton value has not already been obtained desirably the conditton 
value can be derived from the test spectrum itself as shown at block 306. 
This is possible where the measured condition is pH because the 
20 identifk^tton of a peak associated with a pH indicator compound in a sample 
can be readily detennined from tiie test spectrum and the Lorentzian line 
shape values that charactertee the representetion of the calibration compound 
in the test spectrum. 

25 Refening to Figure 11, a process for detemnlnlng a pH conditton value from 
the test spectnjm is shown generally at 310. Basically, the process involves 
identifying a position, height and wkith of a peak associated with a condition 
reference compound in the test spectrum and this may Involve identifying a 
peak meeting a set of criteria that associate the peak with the condition 

30 reference compound. Once the peak is identified the measured condition 
value may be produced as a function of the peak position and parameters of 



the sample medium, the parameters being the parameters that define the 
calibration compound. 



To achieve this. In this emtxxilment, the codes include a block 312 which 
directs the SAA 14 to employ the Lorentzian line shape parameter (c) 
associated with the calibration compound to locate a window in the test 
spectnim. where a peak associated with the pH indicator compound is 
expected to be. The window Is then scanned atong the x^xls (frequency) 
from left to right, for example, for a y^value (intensity) that is greater than the 
amplitude value specified by the Lorentzian line shape parameter (a). 

When a y^value meeting the above criteria is found. bk>ck 314 causes the 
SAA 14 to execute a characterizatton algorithm to produce at least a center 
value (c) representing the center of the peak associated with the pH reference 
compound. For example a Lorentzian curve algorithm may be used to 
produce Lorentzian parameters a. w and c defining the peak associated with 
the pH reference compound. 

Btock 316 then directs the SAA 14 to execute a modified pH titretton Equatfon 
as shown betow. on the center value c and to use certain parameters of the 
sample solvent, in the equation, to produce a conditton value representing pH 
of the sample: 

Where; ^ is the observed chemical shift (center c); 

5a is the chemical shift of the conjugate base; 
6ha is the chemical shift of the conjugate acid; and 
pl^ is an association constant fbr the conjugate base. 



Assume that no matter what method of detemiining pH is used, a pH value of 
5.28 is obtained for the sample. Referring Back to Figure 10B block 320 
directs the SAA to receive the condition value either produced externally, such 
as by measurement or produced intemally such as by using the test spectrum 
as described above, to produce a derived reference record representing a 
derived reference spectrum for use in later functtons of the process. Separate 
derived reference records may be produced from corresponding base 
reference spectrum records associated with corresponding compounds 
expected to be in the sample. Thus, in effsct a rapresentatton of a set of 
derived reference spectra may be produced from a set of reference spectra 
and the measured conditton value. In general, a process for producing a 
representatton of a spectrum for a hypothetteal sotutton containing a 
compound, for use in detemnining the compositton of a test sample, invoh^es 
producing a positk>n value for at least one peak of a reference spectrum as a 
function of the measured conditfon of the test sample and a property of the at 
least one peak In a base reference spectrum. The property may be a positton 
of a peak, amplitude of the peak or wkJth of the peak for example. In this 
embodiment, a derived reference record is used to represent a representatton 
of a spectrum for the hypothetical solution. 

Refem'ng to Figure 12. producing a derived reference record may involve 
accessing a pre-defined record specifying peaks in a reference spectrum and 
adjusting a positton value in the record, the position value being the position 
value of the at least one peak. This may be done by block 322 whteh causes 
the SAA to identify a base reference spectnim record that is associated with a 
condrtton nearest to the measured condition of the sample and to use such 
reference spectrum as the derived reference spectrum. 

Producing a position value for a peak may invoke interpoiating a position 
value from position values associated with base reference spectra associated 
with condition values atK)ve and below the measured conditton value 
associated with the sample. For example, block 324 may be employed to 



cause the SAA 14 to produce a position value by calculatlhg the position 
value as a function of pH of the sample and to effectively produce or 
interpolate a derived reference spectnjm. 

To interpolate a derived reference spectrum, assume that at block 322 a base 
reference record for lactic acid at a pH of 5.10 is located as being the base 
reference spectrum record for lactic add that Is nearest to the pH of the 
sample. 5.28. Such a record is shown In Figures 7A and 7B. Referring back 
to F,gu,B 12. block 324 may direct the SAA 14 to find another base reference 
spectrum record for lactic acW that Is associated with a pH value greater than 
the PH of the sample. Assume that It tocates a base reference spectrum 
record associated lactic acid at a pH of 5.45. A record of this type Is shown In 
F^urBs 13A and 13B. On tocating this second base reference spectmm 
record, btock 324 directs the SAA 14 to create a new derived reference 
spectnim record fbrlactlc ackJ at a PH of 5.28. To do this the SAA 14 is 
directed to make a copy of the base reference spectrum record associated 
wrth a PH of 5.45 and then to replace the frequency values for the center 
posrtion of each cluster shown In that record with interpolated values A 
single linear Interpolatfon Is used to find the value 1.3202 for the first cluster 
and the value 4.1149 for the second cluster. Figures 14A and 14B show the 
resulting derived reference spectrum record for a pH of 5J58. for lactic acid 
produced using this method. SImllariy. derived reference spectrum records 
are produced for each compound In the reference library to produce derived 
reference records for a pH of 5.28 for each compound represented in the 
library. 

Alternatively, adjusting the posrtfon of a peak may involve locating a 
measured condltfon value dependent functfon in a base reference record or 
prsKfefined record, producing the position value from the functfon and 
associating the posltfon value with the pre^lefined record. Associating may 
involve Storing the positfon value In the predefined record, for example To 
eflectfhis method of adjusting the position of a peak, a generic type of derived 



record may be kept, in which equations, effectively specifying the centerPPM 
values for the two clusters as a function of pH may be provided in the field 
associated with the centerPPM value for each duster, as shown in Figures 
ISA and 15B. Then, whenever a pH value is found from a sample, a copy of 
5 the record can be made and the pH value may be used in the equations in the 
copied record to produce centerPPM values. Th^ centerPPM values can 
then be substituted for the respective equations that produced the. In the 
copied record, thereby producing a new derived record for use in iat^r 
calculations. 

10 

Altematively. producing a position value may involve producing the position 
value by addressing a lookup table of positton values with the measured 
condition value of the sample. For example the positiori value of a peak may 
be adjusted by locating, in a pre-defined record, a link to a lookup table 

15 specifying peak positions for vartous condition values, retrieving the positicNi 
value from the tookup table and associating ttte position value with the pre- 
defined record. To do this a second generic type of derived record may be 
kept. In which lookup table' links, effectively specifying links to lookup tables 
(not shown) that return centerPPM values for input pH values may be 

20 provided in the field associated witii the centerPPM value for each cluster, as 
shown in Figures 16A and 16B. Then, whenever a pH value is found from a 
sample, a copy of the record can be made and tiie pH value may be used to 
address the lookup tables associated witti the links specified in the record to 
produce centerPPM values. These centerPPM values can tiien be 

25 substituted for the respective links tiiat produced them, in the copied record, 
thereby producing a new derived record for use in later calculations. 

Refem'ng back to Rgure 10B. after having produced a derived reference 
spectrum for each compound that is likely to be in tiie sampte. block 326 
30 causes ttie SAA 1 4 to calibrate tiie Lx>rentzian line wkfth values for ihe derived 
reference spectrum relative to tiie test spectrum to provide for a better fit to 
tiie test spectrum. To do this, bk)ck 326 may direct ttie SAA 14 to calibrate to 



the (a. c and w) values assodated with the calibration compound in that 
sample, the spectral llnewidths of peaks associated with each of the reference 
compounds. In this embodiment block 326 may direct the SAA 14 to emptoy 
the contents of the Lorentzian wWth adjust field 214 of each derived reference 
spectnjm record to produce respective absolute values representing actual 
linewklths relative to the caiibratton compound linewidth. These modified 
spectral line widths may be associated with respective peaks in the same 
cluster of each reference compound, by storing these modified spectral line 
widths in an internal data stnicture (not shown) that associates modified 
spectre! information with derived r^rence records. 

Still referring to Figure 10B. optionally, compound specific adjustments as 

shown by biocit 328 may be made to the contents of the fields of the derived 
reference records, where it is known, lor example that certain effects occur 
when certain reference compounds are present in the test sample. For 
example, the shift of peaks associated with citrate is affected by the presence 
or absence of certain divalent catkms and therefore the process may include 
a compound-specific adjustment to compensate for shifis known to occur 
when the presence of such divalent cations is known. Other compound- 
specifkj adjustments may be made to compensate for shifts due to 
temperature, chemical interactfc>ns. dHutkm effect and other llgand effects. 

austerCentarinq 

SUM referring to Figure 10B the process may further invoh^e a cluster centering 
step as shown at 330 for shifting the derived reference spectnim In frequency 
(x-directlon) to better align it with the test spectrum. This may involve 

producing a duster positfon indicator for a derived reference spectmm. which 
causes the positions of peaks in the derived reference spectrum to match 
corresponding peaks in the test spednim. A duster position indicator already 
associated with the derived refiBrence spednjm may be used or a duster 
position indicator that produces a match of the derived reference spectrum to 
the test spectmm to a defined degree may be derived from the cluster position 



indicator already associated with tlie derived reference spectnim. In the 
embodiment shown, producing a cluster center indicator is achieved by 
attempting to fit the cluster to the test spectrum. To do this, cluster center 
values around the cluster center value already associated with the derived 
reference spectmm are assigned to the derived reference spectrum and used 
to effectively shift the derived reference spectrum to the left and rfght of the 
current cluster center value. For exanaple, cluster center values +/- 0.001 
ppm points are successively assigned to the derived reference spectrum to 
successively shift the center of the derived reference spectrum at successh^ 
points in a window extending -0.003 ppm to ■H).003 ppm from the currently 
assigned cluster center. At each point, the derived reference spectmm is 
used in a Levenberg-Marquardt (LM) fitting algorithm that detennines a 
correlation value for each position of the center of the derived reference 
spednjm In the window! The center position that causes the LM fitting 
algorithm to produce the t>est correlation value is then associated with the 
derived reference spectnim correlation value and Is used In later calculations. 
Thus in effect, the derived reference spectmm is "wiggled" into alignment wtth 
the test spectmm. This wiggling is done independently for each cluster of 
peaks in the derived reference spectmm. ' 

Upper Boun d Concent^tion Estimates 

Stn refening to Figure 10B, in this embodiment, the process for Identifying 
and quantifying further invokes block 332 which causes the SAA 14 to 
produce an upper bound estimate of a quantity of a compound associated 
with a derived reference spectmm. (or use in a least squares algorithm later in 
the process. In general, producing an upper bound concentration estimate 
comprises setecting as the upper bound concentration estimate, a lowest 
concentration value selected from a plurality of concentration values 
calculated from respective peaks in the test spectmm. This may involve 
finding the height of a peak in the test spectmm that corresponds to a peak in 
the reference spectmm and detemiining a concentration value for the peak as 
a functfon of ite height Prior to detennining a concentration estimate for a 



peak, the process may involve predicting whether the height of a peak in the 
test spectrum is greater than a threshold level and deciding not to determine a 
concentration for the peak when the height is less than the ttireshokl level. 

Referring to Figure 17 a process Implemented by program codes operating on 
the SAA 14 of Figure 1 . for producing an upper bound concentiatfon estimate 
is shown generally at 340. A first block 342 causes the SAA 14 to select a 
reference record. Next btock 344 causes the SAA 14 to sort by height those 
peai<s in the reference record that have a quantiffeation value equal to 1. This 
causes the process to conskJer only those peaks that provide r«liabte 
concentration estimates. Next, block 346 directs the SAA to address the 
(next) highest peak of those that have just been sorted at bk>ck 344 
Reference is made to the "nexT high peak .because the peaks are considered 
in successfon. On the first pass through the process however, the highest 
peak found in the sort is the first peak addressed. 

Next block 348 causes the SAA 14 to use the position of the currently 
addressed peak in the reference spectmm to tocate a corrBsponding peak in 
the test spectrum. This may involve looking for a peak in a window positk>ned 
at a corresponding positton in the test spectrum. On finding such a peak, the 
maximum intensity value (max(y)) associated with that peak is found. 

At block 350. the SAA 14 is directed to calculate a concerrtratfon value as a 
functton of the max (y) value, using the foltowing equatkm: 

Ct - adiUSledwidth ' mavfy) * H ^ncentfatinn * dssp mfnnrati^ ^^7) 

Ossheight ' peakprotonratto 



Where: 



Ct 

adjustedwkith 



is the concentratfon value for the peak 

is the wWth of the peak as determined from 

the variable w calculated as shown in 



dssconcentration 



dssprotonratio 
Dsshetght 



Peakprotonratio 



max(y) 



Figure 9 and the Lorentzian width adjust 
value stored in the reference record 
is the maximum y-value assodated with the 
con'esponding peak in the test spectrum 
is the concentratton of DSS in the sample 
O.SmM, for example 

is the DSS proton ratio (9, for example) > 
is the OSS height value a. cateulated as 
shown in Figure 9 

is the proton ratio of the peak, as indicated 
in the reference record. 



At bkx:k 352 SAA 14 is directed to detemnrne whether the currently, cateuiated 
concentratton value is less than the prevtously calculated value, if so» tiien 
block 354 causes the SAA 14 to set a preliminary upper bound concentratton 
value to the current concentration value. If at block 352, the currently 
calculated concentratton value is not less than the prevtously calculated value, 
the preliminary upper bound concentration estimate value remains at its 
former value. The effect of blocks 352 and 354 is to cause the preliminary 
upper bound concentration estimate to be set to the towest concentration 
value calculated for any of the peaks. 

Once the preliminary value has been detenmined from the cunrent pass, block 
356 directs the SAA 14 to determine whether ail peaks with quantification 
values of 1 have been considered. If so, the SAA 14 is directed to optional 
block 357 in Figure 17. if not, the SAA 14 Is directed to btock 358 which 
causes the SAA to calculate the expected height of the next peak associated 
with the compound, in the test spectrnm. To do this equatton 17 above is 
solved for max(y) using the cunent preliminary concentratton estimate, and 
the Lorentzian wkith adjust value, and the peak proton ratk) of the next 
highest peak from the list of sorted peaks. Then, block 359 in Figure 17 
causes the SAA 14 to detemnine whether the max(y) value so found is less 



than the noise level of the spectnjm. (noise level was calculated at block 250 
in Figure 9). If not. then the next peak is worth consWering and the SAA 14 Is 
directed to resume processing at block 346 to address the next highest peak 
in the sorted list 

If the estimated height of the next highest peak found at block 358 is less than 
the noise level of the spectnjm. the SAA 14 is directed to an cpttonal btock 
357 which increases the amplitude of the preliminary concentration estimate 
value by the amplitude of the noise in the test spectnjm to produce a tme 
estimate of the upper bound concentratton limit for the compound. This is 
useful where concentratkm values are vwy tow. 

Then, finally, block 355 directs the SAA 14 to associate the true upper bound 
concentration estimate with the reference record, such as by storing the upper 
bound concentration estimate value In a fieM (not shown) of the record, or in a 
fieW of a data structure maintained in ttie SAA 14 to create such associations. 

Least squar es Fittina 

Refemng back to Figure 10B. the process for Mentifying and quantifying 
compounds involves a block 334 which causes ttie SAA 14 to perform a least 
squares fitting algorithm using aU of ttie derived reference records and ttie test 
spectmm to produce scaling vahjes for each peak m each reference spectnjm 
such tiiat when ail peaks from an reference spectre are summed ttiey produce 
a composite spectrum tfiat best matches flie test spectium. 

Refening to Figure 18. ttie least squares fitting routine includes a firet btock 
360 which causes ttie SAA 14 to produce "signature" spectra comprised of 
(x.y) paire ttiat define a composite spectnim representative of ttie sum of all 
Lorentzlans in a gh^en derived reference record. A separate signature 
spectnim is produced for each derived reference record. Thus a separate 
(x,y) anay is produced for each derived reference record. 



Block 362 then provides each signature spectmm, upper bound 
concentrations and the (x,y) array representing the test spectrum to a Linear 
Least Squares Fitting routine, which in this emt>odiment is LS SOL licensed 
from Stanford University of California, USA. This routine returns scaling 
factors for each peak in each applicable reference record, such that when the 
scaled Lorentzian models specified in all applicable reference records are 
sumnied together to make a com(K>site spectmni, the composite spectaim 
has features matching features in the test spectrum produced from the 
sample. These scaling factors thus kJentify representative reference spectra 
from a set of reference spectra associated vi^ detectable compounds and 
selected according to the measured oonditton of the sample. 

In ttiis embodiment, an indtoation of compounds assodated with reference 
spectra having peaks that when scaled by the scaling factors have a height 
greater than a threshold may be produced. This may involve producing a list 
of compounds, for example. Thus, scaled peaks having a height less than 
the threshold may indicate that the presence of the associated compound in 
the sample Is questtonable and therefore such compound should not be listed 
as being present in the sample. 

Bkxk 364 then causes the SAA 14 to employ these scaling factors in the 
following equatton to quantify each compound by producing concentratfon 
values for each compound represented by a reference record: 

Cone. = (DSSRatk) * scalingFactor * cdb) / pxDSS 

Where: Cone.: ooncentratfon of the given compound in the 

sample 

DSSRatto: the DSSRatto entry for the given compound 

(see fiekl 202 in Figure 7A) 

scalingFactor the scaling factor of the highest peak in the 
given compound (from least squares fitting) 



the concentration of the gh/en database 
entry (see field 202 in Figure 7A) 
pxDSS: the pixel height of DSS in the spectrum (the 

value a as detennined by the process 
shown in Figure 9) 

aock 366 then causes the SAA 14 to associate these concentration values 
with the compounds associated with the derived reference records. 

Blodc 368 then causes the SAA 14 to produce a list or indication of 
compounds in the sample, along with their associated concentration values 
Th« list may be printed and/or displayed on a monitor, for example' 
concentration values may be expressed in^ moles. mmol/L. g/L or moles/mole' 
for example and absolute quantities may be obtained by a simple equation' 
converting concentration to absolute quantity values. In moles, for example. 

While specie embodimente of the invention have been described and 
.llustrated. such embodiments should be considered iHustrative of the 
.nventlon only and not as limiting the im/ention as construed in accordance 
with the accompanying daims. 



CLAIMS 



A computer-implemented process for producing a trace f9e for use in 
spectrum analysis, tiie method comprising: 

performing a Fourier Transform on Free Induction Decay (FID) 
data to produce an Initial spectmrn; 

filtering a selected region of said initial spectrum to produce a 
filtered spectrum; and 

phasing said fWered spectrum to produce a measured spectrum 
having a flat t)aseiine and positive peato. 

wherein performing a Fourier transform comprises performing a 
weighted Fourier Transfonn with weights that provide for enhancement 
of said initial spectmrn. 

The method of daim 1 wherein filtering comprises applying a notch 
filter to said selected region to suppress a pealc associated with a 
contaminant in said contaminant region. 

The method of daim 2 wherein applying a notch liter comprises 
produdng an adjusted set of notch filter parameters and applying a 
notch filter employing said adjusted set of notch filter parameters to 
said selected region. 

The method of claim 3 wherein applying a notch filter comprises 
tteratively adjusting said set of notch filter parameters and applying 
said adjusted notch filter parameters to a notch filter and applying said 
notch filter to said selected region until a sum of the at>solute values of 
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